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Abstract 

We present a formal model for studying fashion trends, in terms of three parameters of fashionable 
items: (1) their innate utility; (2) individual boredom associated with repeated usage of an item; and 
(3) social influences associated with the preferences from other people. While there are several works 
that emphasize the effect of social influence in understanding fashion trends, in this paper we show how 
boredom plays a strong role in both individual and social choices. We show how boredom can be used 
to explain the cyclic choices in several scenarios such as an individual who has to pick a restaurant to 
visit every day, or a society that has to repeatedly 'vote' on a single fashion style from a collection. We 
formally show that a society that votes for a single fashion style can be viewed as a single individual 
cycling through different choices. 

In our model, the utility of an item gets discounted by the amount of boredom that has accumulated 
over the past; this boredom increases with every use of the item and decays exponentially when not used. 
We address the problem of optimally choosing items for usage, so as to maximize over-all satisfaction, 
i.e., composite utility, over a period of time. First we show that the simple greedy heuristic of always 
choosing the item with the maximum current composite utility can be arbitrarily worse than the optimal. 
Second, we prove that even with just a single individual, determining the optimal strategy for choosing 
items is NP-hard. Third, we show that a simple modification to the greedy algorithm that simply doubles 
the boredom of each item is a provably close approximation to the optimal strategy. Finally, we present 
an experimental study over real-world data collected from query logs to compare our algorithms. 



1 Introduction 



When an individual or a society is repeatedly presented with multiple substitutable choices, such as different 
colors of cars or different themes of musicals, we often observe a recurring shift of preferences over time, 
or commonly known as fashion trends. While some trends are relatively easy to explain (e.g., sweater 
sales increasing in the winter), some other trends may result from a variety of factors. In this paper, we 
first describe a utility model which we think may explain such trends. Then we study the computational 
issues under the model and provide simple mechanisms by which consumers may make close to optimal 
decisions on which products to consume and when, in order to maximize their overall utility. We then 
conduct experiments to show how various parameters in our model can be estimated and to validate our 
algorithm. 

Understanding fashion trends are of significant academic interests as well as commercial importance 
in various fields, including brand advertising and market economics. Therefore, there's a large body of 
work in multiple disciplines - sociology (e.g. H 0), economics (e.g. J7J), and marketing (e.g. ifTTl [T2l . 
on theories for evolution of fashion. Despite much study, there is a lack of a well accepted theory. This 
is probably not surprising as what makes us like or dislike an alternative and how that changes over time 
involves economical, psychological, and social factors. Next we describe three such factors that influence 
fashion. 

First, and perhaps the most basic, cause of a product becoming trendy is its utility, intuitively capturing 
the value it adds to an individual. We call this the innate utility of a product. Second, psychologically, a 
person's utility of consuming a product may be discounted by constant consumption of the same item — as 
one gets tired of existing products, he desires new and different ones. Third, while at an individual level, we 
have certain inclinations based on our tastes, these are influenced by social phenomena, such as what we see 
around us, friends' and celebrities' preferences. 

In this paper, we present a formal model that unifies the aforementioned three broad categories of factors 
using innate utility, individual boredom, and social influence (as depicted in Figure[T]and explained below). 
We attempt to construct a mathematical model for these factors and use the model to explain the formation 
of fashion trends. We use the term item to denote any product, good, concept, or object whose fashion trend 
we are interested in. 

1. Innate utility: The utility of an item captures the innate value the item provides to an individual. We 
assume it is fixed, independent of other influences. 

2. Individual boredom: If we use any item for too long, we get bored of it, and our appreciation for it 
goes down. This is modeled as a negative component added to the utility. This factor grows if one 
repeatedly uses the same item and fades away when one stops consuming the item. 

3. Social influence: Our valuation of an item can change significantly by the valuation of our friends or 
influencing people. For example, when we see that many people around us like something we may 
start liking it; or we may consciously want to differ from some other people around us. We model 
such influences as a weighted linear combination from other people^ 

To model boredom on any item at any given time t, we associate with each past usage of the item, say at 
time if, a factor in the form of (1 — r)* -i for some r < 1. Then the total boredom on the item takes the sum 
of the this factor from all the past usage of the item. This definition captures the intuition that the boredom 
grows if an item is repeatedly used. As we show in our experiments, such exponential decay model matches 
well people's interests in songs and movies. The utility maximization under this model, albeit NP-hard, 
naturally displays cyclic patterns. We also provide a simple strategy to achieve near to optimal utility when 
the decay factor r is small. 

The effect of influence can be formalized using a linear model. For example, to model social influence, 
consider one item and a society consisting of m people. Let G denote the influence graph on these people 

'Additional influence may come from the association of a product to things/concepts we like or dislike. For instance, someone 
may be very fond of green technologies or dislike things that are scary. We may simply model such concepts as individuals. 
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Figure 1 : Factors influencing fashion trends 

that is directed where each edge is labeled with a weight that indicates the strength of this influence. A 
high value on an edge, such as outgoing edges from celebrities indicates a strong outgoing influence; on the 
other hand a negative value indicates a desire to distance oneself or be different from the source node. Let A 
denote the corresponding influence matrix, let Ui(t) denote the utility of the item to the ith person; let u(t) 
denote the vector of utilities. If we assume that for each time step the influence from all friends of a person 
add linearly then we may write u(t + 1) = Au(t), which is similar to ifTTTl . Note that for stability of this 
iterative powers, we should assume that its top eigenvector has magnitude 1.. We will show that under this 
influence model, we may treat the society as an individual making choices under the effect of boredom. 

Discussion of our results. We argue that fashion trends can be viewed as not just the effect of the influence 
of a privileged few but more as a democratic process that churns the social boredom and channels the innate 
instinct for change. Boredom is the innate psychological force that dulls the effect of a constant stimulus 
over a period of time and make us look for newer stimuli. It is well known that the mind tends to grow 
oblivious to almost all types of sensations (visual, olfactory, touch, sound) to which it is exposed for a long 
time. Thus the 'coolness' of a fashionable item drops over time and things that we haven't seen or used in a 
long time begin to appear more 'cool'. 

We show how several scenarios involving individual and social choices are essentially driven by the 
same underlying principles. The individual choice may be as simple as choosing a restaurant to visit on 
a particular day. Alternatively, it may be a social choice where the market forces of a society 'chooses' 
different fashions such as styles for clothing, or cars. Or, a news channel is picking the front page news 
article to maximize readership and has to choose from different types of news articles, e.g., politics, natural- 
disaster, celebrity gossip. Each news item may be popular or fashionable for a period of time and then 
boredom sinks in and the media may switch focus to a different event probably of an entirely different type. 
Boredom is thus the single most and simplest explanation for oscillations in individual and social choices. 
This is not at all surprising; indeed boredom is perhaps a strong influence when we make choices such as 
food, clothes, fashions, governments. Social influence no doubt plays a large part in individual choices. But 
when we look at the social system as a whole the influences across individuals are forces within the system 
and in the net effect it simply gives a larger voice to the more influential individuals. We also note that 
influence by itself is not sufficient to create fashion cycles. In fact, if all influences are positive then without 
any boredom the system u(t + 1) = Au(t) converges to a fixed value resulting in a fixed fashion choice. 

Finally, we recognize that the factors we consider are by no means comprehensive; several other 'exter- 
nal' factors may change the values of nodes. For example a shortage of oil may increase the utility of green 
technologies, the strength of the edges in the graph may change, the structure of the graph may change with 
new node and edge formations. Our decay model for boredom and linear model for influence may be too 
simplistic. Nonetheless, we believe the influence graph and boredom capture several important aspects of 
the underlying psychological processes that people use to value items. 

Outline of the paper. All the main theoretical results achieved by this paper are presented in Section|2] with 



proofs appearing in Section[3] Section[4]presents detailed experimental results for validating our model and 
algorithms; our experiments use real-world data from Google Trends [1] on the popularity of songs and 
movies in the last 3 years. Related work is presented next and we conclude in Section|5] 

Related work. There are several theories of fashion evolution in various communities, e.g., sociologists 
have modeled fashion trends as a collection of several social forces such as differentiation, influence, and 
association. While there have been several explanations of cycles in fashion trends ||3]|4l[5]|7l[l2l, most P ast 
work does not offer a formal study. We compare our work with one notable exception iTTTl next. The focus 
of our paper is on understanding the impact of various factors — boredom, association rules, and utility — on 
the fashion choices made by individuals and a society. We explain the existence of cycles based on our 
formal model of fashion, and provide algorithms for making optimal choices. 

Reference [11J proposed a formal model of fashion based on association rules. Intuitively, an individ- 
ual's utility for an item is impacted by how similar it is to items he likes, and how dissimilar it is to items 
he dislikes. Further, he is influenced by the society through other individuals' preferences for various items. 
Consider a single item, whose consumption vector is given by c(t) at time t. Considering the recurrence 
c(t+l) = Wc(t), where W is the weight influence matrix, [ 1 Q observed that if the matrix W has a complex 
top eigenvalue (corresponding to negative influences), then the item's consumption pattern may be periodic, 
producing cycles in preferences. Our model of utility is similar to the consumption model in iTTTTl . However, 
we consider an additional parameter of boredom that is essential to explain fashion cycles in a society with 
non-negative influences as such a matrix W always has a real top eigenvalue. 

Some other recent work (e.g., [6]) study behavioral influences in social networks, such as in terms 
of information propagation. For instance, El studies how two competing products spread in society, (9J 
provides techniques for tracking and representing "memes", which may be used to analyze news cycles, 
and iflOl studies how recommendations propagate in a network through social influence. 

The focus of our paper is on formalizing a practical theory for fashion trends with boredom, combined 
with utility and a simple social behavior. Therefore, for a large part of the paper we consider only a single 
individual and study fashion trends based on boredom, and utility. Further, in our extension to multiple 
individuals, we assume a linear weighting of influences from friends' preferences for particular items. 

2 Contributions of Our Study 
2.1 Modeling individual boredom 

We consider a user living in discrete time periods 0, 1, . . . and consuming one item among n substitutable 
items at each time; for example, a person needs to decide which restaurant to go to every night or which 
political party to vote for every four years. We assume that each item i brings a base utility Vi to the user. 
Now if we assume that the utilities are fixed then the user would always choose the same item with the 
maximum Uj. This would be inconsistent with the observed common behavior of cycling among multiple 
items, which we refer to as fashion cycles. In order to explain fashion cycles, it is necessary to model the 
utility dependence of the consumptions across different time periods. 

We propose a simple model in which the utility of an item at any time t is the base utility discounted by 
a boredom factor proportional to the "memory" the person has developed by using this item in the past. The 
more the user has used the item, the more memory and boredom is developed for the item, and consequently 
the less utility the item has to the user. 

We naturally assume that the memory drops geometrically over time, and the total memory of a person 
is bounded. This leads to the following definition of memory. Let < r < 1 be a memory decay rate, i.e., 
the rate at which a person "forgets" about things. Let Xi(t) G {0, 1} indicate if the user uses the item i at 
time t. Then the memory of i at time t is Mj(i) = r X^t=o x *( r )(l ~~ r) t ~ T . We add the factor r so that 
Mi(t) < 1. The boredom bi(t) = a.iMi(t) is proportional to the memory and depends on the item. The 
utility of item i is defined as Uj(i) = Vi — bi(t) = vi — ajMj(t). Henceforth, we will refer to v as the base 
utility and a as the boredom coefficient. 
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2.2 Utility optimization with boredom 

With the above model, one natural question is to compute the choices of the items to maximize the user's 
overall utility. If we allow the user to choose at continuous time, the maximization problem becomes rela- 
tively easy as the best way to consume an item is to do it cyclically at regular time intervals. However, such 
regular placement may not be realizable or is hard to find. As we will show below, it is NP-hard to compute 
the best consumption sequence. 

We also consider the natural greedy strategy and show that the under the greedy strategy, the utility of 
each item is always bounded in a narrow band and so each item is consumed approximately cyclically. The 
greedy strategy, however, may have produce a sequence giving poor overall utility. We provide a simple 
heuristics, called double-greedy strategy, and show that it emulates the cyclic pattern of the optimal solution 
on the real line and yields utility close to the optimal when r is small. 

2.2.1 Greedy algorithm 

In the greedy strategy, at each time t, the user consumes the item with the maximum utility Ui(t). This 
strategy is intuitive and probably consistent with how we make our daily decisions. We show that the utility 
gap between any two items is small all the time. We provide an example to show it has poor performance in 
terms of utility maximization. Denote by a = maxj a\. 

Theorem 2.1. There exists a time T such that for any t > T, u < maxj Ui(t) < u + 0{ra log n) where u 
is the unique solution to the following system: 

For all items with fa > 0, the quantity Vi — facti = fi; if fi = then Vi < fi; and ^ /j = 1. 

While the greedy algorithm has the nice property of keeping the utility gap between any items small, it 
may produce a sequence with poor overall utility. 

Observation 2.1. The Greedy strategy of always picking the highest utility item each day is not optimal. 

To see the non-optimality of greedy, simply consider two items for beverage, say "water" and "soda". 
Assume water has low base utility say 1 that never changes and zero boredom coefficient. Soda on the other 
hand has high utility say 10 but also a high boredom coefficent say 10. So if one drank soda every day its 
utility would drop to below that of water. Observe that the greedy strategy will choose soda till its utility 
drops to that of water and then it is chosen whenever its utility rises even slightly over 1. So the average 
utility of the greedy strategy is close to 1. A smarter strategy is to hold off on the soda even if it is a better 
choice today so as to enjoy it even more on a later day. Thus it is possible to derive an average utility that 
is much higher than 1. For example, we can get average utility of about 3 by alternating between water and 
soda in the above example. Note that the greedy algorithm produces poor performance in the above example 
even for small r. 

This naturally raises the question: what is the optimal strategy? More importantly is there an optimal 
strategy that is a simple 'rule of thumb' that is easy to remember and employ as we make the daily choices. 
Unfortunately it turns out that computing the optimal strategy is NP-hard. 

2.2.2 NP-hardness 

Theorem 2.2. Given a period T, target utility U*, and n items, it is NP-hard to determine whether there 
exists a selection of items with period T such that the total utility of the selection is at least U*. 

2.2.3 Double-greedy algorithm 

On the positive side we show that there is indeed a simple "rule of thumb" that gives an almost optimal 
solution when r is small. The strategy "double-greedy" waits longer for items that we get bored of too 
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quickly. It is a simple twist on the greedy strategy: instead of picking the item that maximizes the utility 
Ui(t) = Ui — bi(t), it picks the one which maximizes Wi(t) = U{ — 2bi(t). Thus it doubles the boredom of 
all items and then runs the greedy strategy. We show that: 

Theorem 2.3. Let U denote the average utility obtained by the double greedy algorithm and U* the optimal 
utility. Then U > U* — 0{ra\ogn) where a = maxj on. 

We note that when r — > 0, the utility produced by double greedy is close to the optimal solution. 
2.3 Fashion as a Social Choice 

A choice is a fashion, if it is the choice of a large fraction of the society. Thus a society only supports a 
small number of fashions. Industries often target one type of fashion for each market segment. Consider a 
situation where the entire society consists of one fashion market segment. We will see how in this case such 
a society can be compared to an individual making choices to maximize utility under the effect of boredom. 
Each individuals utiltities depend not only on his base utility and boredom but also on the influence from 
other individuals. 

Consider a society of n people and m possible item choices. The society needs to choose one item 
out of these at every time step. We will study the problem of the makiing the optimal choice so as to 
maximize welfare. This is applicable in the following scenarios: A business is launching the next fashion 
style for its market segment, or a radio channel is broadcasting songs in a sequence to maximize the welfare 
to its audience. Let Uij(t + 1) denote the utility of item i to person j at time t; let bij(t) denote the 
boredom value; let Uj(t) denote the vector of utitilities to the n people for item i, Vj(i) denote the vector 
of base utilities, and bj(t) denote the vector of boredom values. In the absence of boredom we will say 
Ui(t + 1) = A\ii(t) where A is the influence matrix. Accounting for boredom we will say, Uj(i + 1) = 
A Ui(t) — [hi(t + 1) — bj(£)]. Note that this is consistent with the case when there is only one individual 
where Ui(t + 1) = itj(£) — [b(t + 1) — b(t)]. Observe that ignoring the effect of boredom we simply get the 
recurrence Uj(£ + 1) = Aui(t) or Uj(t) = A*Vj. This recurrence reflects the diffusion of influence through 
the social network. Note that if the largest eigenvalue of A has magnitude more than 1 then the process will 
diverge and if all eigenvalues are < 1 it will eventually converge to 0. So we will assume the maximum 
eigenvalue of A is has magnitude 1. If the gap between the magnitude of the largest and the second largest 
eigenvalue is at least e then this diffusion process converges quickly in about 6(l/e) steps. We will focus on 
the case when rate of boredom r is much slower than the diffusion rate (this corresponds to the case where 
influences spread fast and the boredom grows slowly). We then study the problem of making social choices 
of items over time so as to maximize welfare. 

We will assume that A is diagonlizable and has a real top eigenvalue of 1 and all the other eigenvalues 
are smaller in magnitude. In that case it is well known that for any vector x A t x converges to to a fixed 
point and the speed of convergence depends on the gap between the largest and second largest eigenvalue. 
We show that under certain conditions if r/e is small then, the choices made by the society is comparable 
to the choices made by an individual with appropriate base utilities and boredom coefficients. Let Wj(t) 
denote the welfare of the society at time t by choosing item %; then Wj(i)/n 

Theorem 2.4. Consider a society with influence matrix A that has largest eigenvalue 1 and second largest 
eigenvalue of magnitude at most 1 — e. For computing the welfare over a a sequence of social choices 
approximately, such a society can be modelled as a single individual with base uitilities Vi and boredom 
coefficients on, where Vi = c'vj and on = c' en for some vector c. Let Uj(t) denote the utility of item i to 
such an individual at time t. 

More precisely, differences in the average utility of the society for the same sequence of choices until 
anytime \Wi(i)/n— Ui(t)\ < jO(\ati\oo)for any t > T for some fixed T. The O notation hides factors that 
depends on A. For a real, symmetric matrix the constant is 1 
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3 Technical details 



3.1 Individual choice 



The following Lemma is used in the proof of Theorem 2.1 



Lemma 3.1. Ei M *(*) < l > and Ei M iO) -> 1 f° r lar Se t When t = fi(l/r), E^M^i) = 1 - 
0(exp(-tr)). 

Proof. Observe that the memory scales down by a factor of 1 — r each time step; exactly one item is 
picked and r is added to its memory. So Ej Mi(t + 1) = (1 — r) Ej Mj(t) + r - This recurrence gives, 
ZiMi(t) = (1-r)* E i M,(0)+rE* =0 (l-^+(l-^(E i M l (0)-l). Since M^O) = O.E.M^t) < 1. 
Observe also that after t = f2(l/r) steps this becomes 1 + 0(exp(— tr)) □ 



We are now ready to prove Theorem 2.1 



Proof. (Theorem 2.1 1 To see that the solution to the given system is unique, note that fa = (^— -) + 

(where x + denotes max(x, 0), and so Ej(^^) + = 1- This must have a unique solution as Ei(^p) + * s 
decreasing function of u and strictly decreasing as long as the sum is positive. Let u denote the solution to 
the above system. 



We now show maxj Ui{t) > u for any t. This is done by contradiction. Suppose that for all i Ui(t) < u. 

i-u\+ < y> Vi-Ujt 
oti ' ^—ti a.i 



We have that )+ < £ ?; But v -^fi = E, M t (t) < 1. We have that £,(^) + < 1, 



a contradiction. 

Let S g denote the set of all the items ever picked by the gree dy algorithm. Let T be the time by which 
each item in S g has been used at least once. By Lemma 3.1 after some steps X^Mj(i) converges to 



arbitrarily close to 1. Lets assume for simplicity of argument that it is exactly 1 with sufficiently large T. 
To show the upper-bound on maxj Ui(t), we show that for t > T and any i S S g , max^ Uj(t) — Ui(t) = 
0(ar logn). 

Denote by x (t) the item that has the maximum utility at time t. It suffices to show that u x u\(t) < Ui{t) + 
0(ar log n). We recursively compute a decreasing sequence of tj as follows. Let t\ = t. For j > 1, suppose 
we have computed Let Sj-i = {x(tx), xfa), ■ ■ ■ , x(tj-i)}. Now let tj = max t / < ^._ 1)a .( t /)^ ( s'._ 1 t'. 
We stop when there is k such that x{t^) = i. Since t > T, the process is guaranteed to stop. By the above 



construction, we know only items in Sj-i are picked by the greedy algorithm in the interval [tj + 1, ti]. For 



any 5 C {1, ••• , n }, let A(S, t) = E, e5 ^ and B(S) = Zees h We wiU show that for K .7 < *■ 



AiSi-x^-x) < B ^y A (S J ,h) + r. (1) 



First observe that 



A(S, t) = y^l = Y V t- = yYi_y m) (2) 

^-^ at ^-^ at ^-^ at ^-^ 

ies 1 ees 1 ees 1 ies 

([TJ) follows from the following claims. 
Claim 1. A(Sj-x,tj-x) < A(Sj-x,tj + 1). 

Proof. Since any item picked by the greedy algorithm in [tj + 1, tj— i] is in Sj-x, we have that for t' € 

[tj + l,tj-x), EieS^ M dt' + 1) = (1 - r) Efcs,-! M t(?) + r > EteS^ M ^')- The last inequality is 
by Et M t(P) < 1. Therefore EteSj-i M ^j-i) > E/ 6 5j_i M i( t J + 1 )- B y @' we have A (Sj-i, tj-x) < 
A(Sj-x,tj + 1). □ 
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Claim 2. A(Sj-i, tj + 1) < A{Sj ,tj) + r. 

Proof. Since t,- ^ S^-i is the item picked by the greedy algorithm at tj, E^es- M(.(tj + 1) = (1 ~~ 

M ^j)- Thus E^_! + 1) - EteJi-a M ^i) = r EteSi-! ^ r " Again by Q, 

we have tj + 1) < A(Sj,tj) + r. □ 



Claim 3. %) < B ^^ L A{Sj,t j ). 

Proof. Immediately follows from (tj) > ug(tj) for I € Sj-i- □ 
Repeating ([T]), we have that 

B(Sj-i) Afn . B(Vi) / 5(5,-) n \ 

~ W&T) A{Si+1 ' ti+l) + ' + r " " " - tifc) + r • 2^ ~bW ■ 

Hence, we have that 

u x(tl) ( tl ) = a x A { S lM ) < o* (fjf^Sfe, t fc ) + r • g = ^(^, t fc ) + r • g -L_ . 



Since i = x(t fc ), for any i', ?v(i fc ) < u;(i & ). Therefore A(S k ,t k ) = J2jeS k ^ EjeS fe i 



*7 

Ui{tk)B(Sk). By that a = maxj aj, we have B(Se) > tja. Hence 

| fc-l , fc-i 

'^ {tl) -BW) A{Sk,tk)+r '^B(S i , 



^ fc-l fe-l 

tl )(*l) < 7 ? 7^T^(^' t A ; )+r-^— v < Ui(i fe )+ar ^ l/^ = Ui(t fe )+0(ar log n) 



Since item i is not used during the interval of [t k + 1, t\], we have Ui(t k + 1) < itj(ii), and hence Ui(t k ) < 
ttj(tfc + l) + ajr < Uj(ti) + ajr. Therefore we have that, max.,- u.,(ii) = Uj-m^i) < Ui(ti) + 0(ar log n). 

On the other hand, we know that there exists i € S g such that Ui(t) < u because otherwise it would 
be the case that > Ei = Ei M »(*) ~ 1, a contradiction. Hence waXjUj(t) = 

u + 0(ar log n). □ 



3.2 NP-hardness of item selection 

A selection Y is periodic with period T, if for any t, y(t + T) = y(t), where y(t) is the item chosen 
at time t. Clearly, in a periodic selection, the utility of the item chosen at time t is the same as the one 
chosen at time t + T. For utility maximization, it suffices to consider those items chosen in [0, T). Let 
U ( Y ) = Tl=o u y(t) (*) denote the total utility of Y in [0, T). 

Theorem 3.2. It is NP-hard to decide, given T, and U*, and n items, whether there exist a assignment Y 
with period T such that U (Y) > U*. 

Proof. The reduction is from the Regular Assignment Problem and is detailed in the appendix. □ 
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3.3 Optimality of double-greedy algorithm 



Using the exactly same argument in the proofs of Theorem 2. 1 we have that 

Lemma 3.3. There exists a time T such that for any t >T, fi < max, Wi(t) < fi + 0(a log nr) where fi is 
the unique solution to the following system: 

For all items with fi > 0, the quantity Vi — 2fai = fi; if fi = then Vi < fi; and fi = 1. 



By using the above theorem, we can prove Theorem |2. 3 as follows. 

(l-r) 1 // 



Proof. (Theorem 



2.3) For < / < 1, write A(/) = r • 



l-(l-r) 1 //- 



Let U* be the optimal value of the following program. 

max U = fi(vi - otifi) s.t. ^ f { < 1 and f { > 0. (3) 

i i 

Let OPT denote the optimal average utility. We have that OPT < U* + ar. This is by observing that 
for any < / < 1, placing an item 1/ / apart gives an upper bound on the utility of consuming the item 
with frequence /. The bound is v — aA(f) < v — a(r — f) by observing that A(/) > r — f. 

The objective of ^ is maximized when there exists A such that |^ = A for f > and |^ < A for 

fi = 0, and Y^i fi = 1- Since |^ = V{ — 2aif, A is exactly the same as fx in the statement of Lemma 3.3 
This explains the intuition of the double greedy heuristics — it tries to equalize the marginal utility gain o 
each item. Denote the optimal solution by /*. Then for /* > 0,Vi — 2ctif* = fj,. Hence, 

u* = Yl f*( v i - = E fib* + = » + E «^* 2 • 

i i % 

Let ki denote the number of times item i is used in [0, T] by the double-greedy algorithm, and fi = ki/T. 
Let Mi denote the average memory on i at the times when i is picked. Then we have that 

T 

v = E^wW/ T = E E «i(*)/ r = E E (Mt) + ^M t (t))/T 

t=0 i x{t)=i i x(t)=i 

> E E & + a i M i( f )) l T (by Lemma|33j Wj(t) > n) 

i x(t)=i 

> n + ^mfiMi. (4) 

i 

Write 5 = ar log n. By Lemma [33] Wi(t) = fi + 0(5) for each i, t. We will show that 

Claim 1. a>ifi = otif*—0(5). Observe that for any item i which is picked k{ times in [0, T], min <t<T Mi(t) < 
A(fcj/T) < ki/T = fi. Hence, max t u)j(i) > v\ — 2ajMj(t) > Vi — 2a,/j. On the other hand, 
Wi (t) = fi + 0{5). We have v { - 2a { fi = fi + 0(5). But fi = Vi - 2 ai f*. Therefore a { fi > a { f* - 0(5). 

Claim 2. a^Mj = aif* — 0(5). Since V{ — 2Mj < fi + 5, we obtain the bound by following the same 
argument as in the proof of Claim 1. Now, plugging both claims into ([4]), we have that 

u>fi+j2 <ft - —? > v + E a ^ 2 - °w = u *- °^ = OPT - °( 5 ) ■ 

This last equality follows from U* = OPT — ar. This completes the proof. □ 
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Song 


V 


(X 


r 


The Climb 


12.3 


9.9 


0.097 


Lucky 


2.6 


1.58 


0.114 


Snow Patrol - Chasing Cars 


10.7 


6.8 


0.127 


I know you want me 


7.95 


6.5 


0.077 


Viva la vida 


12.4 


9.1 


0.16 


Stop and stare 


10.5 


9.4 


0.092 


Disturbia 


8 


7.2 


0.092 


Pocket full of sunshine 


7.6 


6.3 


0.14 


Supernatural superserious 


24.2 


22 


0.15 


One step at a time 


9.35 


8.5 


0.075 



Movie 


V 


a 


r 


Godfather 


6.15 


5.15 


0.123 


Hancock 


9.6 


8.8 


0.128 


The Bucket List 


13.1 


11.8 


0.102 


Quantum of Solace 


29.8 


29 


0.111 


Tropic Thunder 


25.6 


24.8 


0.082 



Table 2: v, a, r for the set of movies. 



Table 1: v, a, r for the set of songs. 



3.4 Social Choice is equivalent to individual choice 

Let 1 denote the vector with all coordinates set to 1 and a, denote the vector of boredom coefficients a^-. 

Observation. For any diagonolizable matrix A with largest eigenvalue 1 and the second largest eigenvalue 
is at most 1 — e, there is a vector c so that. l'A t x — c'x < (1 — e^y/nO^x^). The O notation hides factors 
that depends on A. For a real, symmetric matrix the constant is 1. 

Proof. We will sketch the proof for real symmetric matrices. The same idea holds for non-symmetric 
matrices. If pi, . . . ,p n denote the eigenvectors of A and 1 = Ai , . . . , A n denote the eigenvalues then A l x = 

Ej fy^x = Piv[x + £\>i X^p'^iX. Now,— J2j>i ^jPiPi x \2 < (1 - e)*M2- So |l'(A*a; - pip[x)\ < 
1 1 1 2 ( 1 — e)*|x|2 = y/n{l — e)* | a; 1 2 Setting c = Vviv^ completes the proof. □ 

We will now prove theorem [24] 



Proof. (Theorem 2.4 1 Let Ab(i) denote b(t) - b(t - 1). Now Uj(i) = Au^t - 1) + Ab(t). This gives, 



Ui(t) = A'vi + E =o A>Ab(t). Note that Wi(t) = l'ju(t) = l'A*v + E =o 1>A3 ^ ~ 5) 



Note Abij(t) 
Now |l'A*v,- 



o 



c'v. 



((1 



rlMyW + r/.fi) - JWii(t)) = ayr(x<(i) -M i3 -(t)). So |A6 ii (t)| 2 < r| ai | 2 . 
< {I — ^ y/nO(\~Vi\2). Fort > (l/e)0(log(nr|vi|2), this is at most r. Also 
ll^Ab^-jO-c'Ab^i-j)! < (l-e)^0{\Abi{t-j)\ 2 ) < (l-ey^nrO(\ai\ 2 ). So, | ^ l^'Ab^t- 

i) -EjC Abi(t-j)| < (r/e)VHO(|ai| 2 ). So, | Ej 1^ Abj(t-j) -c' < (r/e)^0(\a t \ 2 ). There- 
fore |Wj(i) — (c'vj — c'bj(t))| < (r/e)0(- v /n|aj|2). Dividing by n completes the proof. □ 



4 Experiments 

In this section, we provide experimental results to study the techniques presented in the paper. Our primary 
objectives is to evaluate the quality of greedy and double-greedy algorithms for choosing items based on 
utility and boredom parameters estimated from the real data. 

4.1 Setup 

We obtain data on the popularity of songs and movies from Google Trends [1]. We collected weekly aggre- 
gate counts from query logs for popular songs from the last 3 years. Similar data was collected for popular 
movies. While the popularity of songs and movies depends on additional factors such as awards won by an 
album or a movie, our goal was to perform a controlled experiment only based on overall utility and bore- 
dom. Therefore, for each item we collected weekly aggregate counts starting from the highest peak in logs 
till there was an "artificial peak" due to an external event such as an award. Further, we compare the utility 
obtained by our model with a baseline in which the user selects an item simply based on its utility without 
any discounting from boredom. We describe how we compute the values of a, v, and r in the appendix. 
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Song 


Greedy 


Double-Greedy 


The Climb 
Snow Patrol - Chasing Cars 
Viva la vida 
Stop and stare 
Supernatural superserious 


11.17(0.17) 

-(0) 
11.61(0.17) 

-(0) 
12.22(0.67) 


11.11(0.17) 
10.24(0.12) 
11.24(0.21) 
10.08(0.08) 
17.52(0.40) 



Table 4: Avg. utilities (frequencies) for selected songs. 



Movie 


Greedy 


Double-Greedy 


Hancock 
The Bucket List 
Quantum of Solace 
Tropic Thunder 


-(0) 

-(0) 
16.41(0.55) 
15.77(0.45) 


9.56(0.04) 
11.40(0.20) 
20.37(0.40) 
18.01(0.36) 



Table 5: Avg. utilities (frequencies) for selected movies. 



Table[T]shows the v, a, and r values for a set of 10 songs used in our experiments while the corresponding 
data for the movie data set is shown in Table [2| here we allow different values of r, but we notice that all 
r-values within the domain of songs and movies are similar. 



4.2 Results 

We ran a set of experiments to verify the effectiveness of the greedy and double-greedy heuristics. We ran 
the experiments over 100000 steps for both the data sets. The average utility obtained by the user for both 
the data sets was computed and is shown in Table [3j We also show results for the baseline approach that 
always picks the same item with the highest base utility. Tables|4]and [5]illustrate the average utility obtained 
by the user over the selected songs and movies respectively. The corresponding normalized frequencies are 
shown in parenthesis. As expected, in the baseline case where the user selects an item according to its base 
utility, the movie Quantum of Solace (with a base utility of 29.8) is always selected while in the case 
of songs, the song supernatual superserious (with a utility of 24.2) is selected. Unsurprisingly, 
the average utility discounting boredom for this cas e is very low (see Table |3). 



Dataset 


Greedy 


Double-Greedy 


Baseline 


Songs 
Movies 


11.94 
16.12 


13.53 
17.30 


5.62 
4.25 



Table 3: Average utility over 100000 time steps. 



Double-Greedy 
Baseline 




Number of steps (xS) 



Figure 2: Change in average utility over time. 

In another experiment, we measured the change in the average utility with time. Figure [2] illustrates 
the change in average utility as the user selects different items at each time step for movies. Naturally, the 
utility is highest at the very beginning as the user picks an item with the highest base utility and decreases 
subsequently as she picks items with highest discounted utility at each time step. 



5 Future Work 

As we mentioned, our model is by no means comprehensive. For example, boredom may come from con- 
suming similar items, or there may be a cost when switching from item to item. Taking into account these 
factors raises some interesting algorithmic issues. Fully incorporating these extensions is left as future work. 
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A Computing the model parameters 

Figure [3] shows the trend observed for a specific song from our dataset, I Know You Want Me, over 
a 45-week period starting August 2, 2009. The first natural observation we make is that the total number 
of queries do indeed display a steady decline, which we attribute to boredom. From the data, we use the 
maximum count as the peak utility, v pea k, and let the final count be denoted v final. We set a = v pea k—Vfi na i. 
Let X(t) denote the aggregate count for the week t, we obtain the boredom parameter r using the following 
equation: 

g — rt _ _ Vat-^W 

We plot rt = — mfl — Vpeak X ® ), and fit a linear line on the resulting curve and obtain r from the slope. 
Figure [4] shows the curve for I Know You Want Me, from which we obtain the r value. 



B NP-hardness of item selection 



Restatement of Theorem 3.2 : It is NP-hard to decide, given T, and U* , and n items, whether there exist a 
assignment Y with period T such that U (Y) > U* . 

Proof. The reduction is from the following problem. 
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Aggregate Count 




1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 



Figure 3: Weekly aggregate query counts for 
I Know You Want Me for a 45-week period 
from Google trends. 



Approximate-linear Curve 




1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 



Figure 4: Approximate linear trend for I Know 
You Want Me, with slope giving r. 



Regular assignment problem (RAP). Given positive integers pi,P2> • • • ,Pn, determine if there exists a 
sequence yo,yi, - ■ ■ where y t G {0, 1, • • • , n} such that for any i ^ 0, two consecutive appearances of i in 
the sequence are exactly pi apart. 

It is shown in [2J that the regular assignment problem is NP-complete. Note that for RAP, a regular 
assignment exists if and only if it does so on a cycle with length T = YliPi- We will now reduce it to the 
optimal fashion selection problem. 

Given pi, • • ■ ,p n , we create n + 1 items such that a regular assignment, if exists, maximizes the utility 
of any periodic selection with period T. Hence we can reduce RAP to the optimal selection problem. Item 

is a special item with vq = 1 and a, = 0. For 1 < i < n, we assign Vi = — and a\ = 1. Further let 

Pi 

Vi = l/T for 1 < i < n. We claim that there exists U* and e > l/T 2 such that for a regular assignment Y, 
U(Y) > U*, and U(Y) < U* - e otherwise. 

Consider the case when there is only item and when the selections are made on the real line. Given T 
and an item with parameters v, a, r, let y k (v, ck, r) be the set of all the selections which have period T and 
choose the item exactly k times on the real interval [0, T). Denote by U^{v, a, r) = masLy^y h t v a r \ U (Y) 
and 5Uk(v, a, r) = Uk(v, a, r) — U^x(v, a, r). The correctness of the reduction follows from the following 
claims. 

Claim 1. Uk(v, a, r) = kv — ka ^^} r ^ T / k , and the maximum is achieved with the regular assignment. 

Claim 2. For 1 < v < n, U k {vi, 1, l/T) = k Vi - (k 2 - \k + ^ + o(l/fc)), and SU k (vi, 1, l/T) = 
Vi - (2k - | + o(l/k)). 

Claim 3. For any non-regular integral selection Y G y k (vi, 1, l/T), U(Y) < U k {vi, 1, l/T) - l/T 2 . 

Claim 1 holds because the total memory is minimized when the k assignments are regularly spaced. 
Claim 2 is a direct consequence of Claim 1 by Taylor expansion on those particular parameters. Claim 3 
follows by comparing the memory caused by adjacent items between regular and non-regular assignments. 

From Claim 2, we can see that 5U k {v i: 1, l/T) > 3/2 for k < T/jh and < for k > T/p { + 1 for 

1 < i < n, and 5U k (vo, 0, l/T) = 1. Combining it with Claim 3, we have that the utility gap between a 
regular and non-regular assignment is at least l/T 2 . Therefore the reduction is correct and can be done in 
polynomial time. 

□ 
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